Convergence to Global Optimality with Sequential Bayesian Sampling Policies
نویسندگان
چکیده
We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes problems in ranking and selection, reinforcement learning, and continuous global optimization. We give sufficient conditions under which measurement policies achieve asymptotically minimal expected loss. Achieving asymptotically minimal expected loss implies that the sequence of decisions believed to be the best under successive posterior distributions converges almost surely to the set of globally optimal decisions. This condition is most useful for adaptive sequential sampling policies, which often perform better than nonadaptive policies, but whose convergence is often difficult to confirm by other means. We apply these sufficient conditions to show convergence to global optimality for three previously proposed ranking and selection policies: OCBA for linear loss, LL(S), and LL1. We also show how this sufficient condition may be applied to knowledge-gradient policies.
منابع مشابه
Asymptotic Optimality of Sequential Sampling Policies for Bayesian Information Collection
We consider adaptive sequential sampling policies in a Bayesian framework. Under the assumptions that the sampling distribution is from an exponential family and that the number of distinct measurement types is finite, we give sufficient conditions for an adaptive sampling policy to achieve asymptotic optimality. Here, asymptotic optimality is understood to mean that the limit of the expected l...
متن کاملProgressive Global Random Search of Continuous Functions I
A sequential random search method for the global minimization of a continuous function is proposed. The algorithm gradually concentrates the random search effort on areas neighboring the global minima. A modification is included for the case that the function cannot be exactly evaluated. The global convergence and the asymptotical optimality of the sequential sampling procedure are proved for b...
متن کاملSequential Optimality Conditions and Variational Inequalities
In recent years, sequential optimality conditions are frequently used for convergence of iterative methods to solve nonlinear constrained optimization problems. The sequential optimality conditions do not require any of the constraint qualications. In this paper, We present the necessary sequential complementary approximate Karush Kuhn Tucker (CAKKT) condition for a point to be a solution of a ...
متن کاملConsistency of Sequential Bayesian Sampling Policies
We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes ranking and selection, continuous global optimization, and many other problems in sequential experimental design. We give a sufficient condition under which measurement policies sample each measurement type infinitely often, ensuring consistency, ...
متن کاملA Globally Convergent Primal-dual Interior-point Filter Method for Nonlinear Programming: New Filter Optimality Measures and Computational Results
In this paper we prove global convergence for first and second-order stationarity points of a class of derivative-free trust-region methods for unconstrained optimization. These methods are based on the sequential minimization of linear or quadratic models built from evaluating the objective function at sample sets. The derivative-free models are required to satisfy Taylor-type bounds but, apar...
متن کامل